On Bayesian Model Assessment and Choice Using cross-Validation Predictive Densities

نویسندگان

  • Aki Vehtari
  • Jouko Lampinen
چکیده

We consider the problem of estimating the distribution of the expected utility of the Bayesian model (expected utility is also known as generalization error). We use the cross-validation predictive densities to compute the expected utilities. We demonstrate that in flexible non-linear models having many parameters, the importance sampling approximated leave-one-out cross-validation (IS-LOO-CV) proposed in (Gelfand et al., 1992) may not work. We discuss how the reliability of the importance sampling can be evaluated and in case there is reason to suspect the reliability of the importance sampling, we suggest to use predictive densities from the k-fold crossvalidation (k-fold-CV). We also note that the k-fold-CV has to be used if data points have certain dependencies. As the k-fold-CV predictive densities are based on slightly smaller data sets than the full data set, we use a bias correction proposed in (Burman, 1989) when computing the expected utilities. In order to assess the reliability of the estimated expected utilities, we suggest a quick and generic approach based on the Bayesian bootstrap for obtaining samples from the distributions of the expected utilities. Our main goal is to estimate how good (in terms of application field) the predictive ability of the model is, but the distributions of the expected utilities can also be used for comparing different models. With the proposed method, it is easy to compute the probability that one method has better expected utility than some other method. If the predictive likelihood is used as a utility (instead of application field utilities), we get the pseudo-Bayes factor as a special case. The proposed method can also be used to get samples from the distributions of the (prior), posterior, partial, fractional and intrinsic Bayes factors. As illustrative examples, we use MLP neural networks and Gaussian Processes (GP) with Markov Chain Monte Carlo sampling in one toy problem and two real world problems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bayesian Model Assessment and Comparison Using Cross-Validation Predictive Densities

In this work, we discuss practical methods for the assessment, comparison, and selection of complex hierarchical Bayesian models. A natural way to assess the goodness of the model is to estimate its future predictive capability by estimating expected utilities. Instead of just making a point estimate, it is important to obtain the distribution of the expected utility estimate because it describ...

متن کامل

Bayesian Input Variable Selection Using Posterior Probabilities and Expected Utilities

We consider the input variable selection in complex Bayesian hierarchical models. Our goal is to find a model with the smallest number of input variables having statistically or practically at least the same expected utility as the full model with all the available inputs. A good estimate for the expected utility can be computed using cross-validation predictive densities. In the case of input ...

متن کامل

Bayesian Input Variable Selection Using Cross-Validation Predictive Densities and Reversible Jump MCMC

We consider the problem of input variable selection of a Bayesian model. With suitable priors it is possible to have a large number of input variables in Bayesian models, as less relevant inputs can have a smaller effect in the model. To make the model more explainable and easier to analyse, or to reduce the cost of making measurements or the cost of computation, it may be useful to select a sm...

متن کامل

On Bayesian Model Assessment and Choice Using cross-Validation Predictive Densities: Appendix

Short description of the prior and the MCMC specification details is given here. See (Neal, 1996, 1997, 1999; Lampinen & Vehtari, 2001) and the FBM software manual (Neal, 2000) for additional details. The MCMC sampling was done with the FBM1 software and Matlab-code derived from the Netlab2 toolbox. FBM was used when possible (normal and logistic models) because of its speed. Matlab-based imple...

متن کامل

Mcmc Methods for Bayesian Model Comparison with Censored Multivariate Data

Model comparison and selection are among the most common problems of statistical practice, with numerous procedures for choosing “the best” among a class models proposed in the literature (see Kadane and Lazar (2001) for a recent review). However, comparing and selecting among richly parameterized Bayesian hierarchical models challenge standard procedures. Formal Bayesian methods for model sele...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001